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Abstract 

Mayer's theory of cluster integrals allows one to write the partition function of a gas model 
as a generating function of weighted graphs. Recently, Labelle, Leroux and Ducharme have 
studied the graph weights arising from the one-dimensional hard-core gas model and noticed 
that the sum of the weights over all connected graphs with n vertices is (— n) n . This is, up to 
sign, the number of rooted Cayley trees on n vertices and the authors asked for a combinatorial 
explanation. The main goal of this article is to provide such an explanation. 



1 Introduction 

In [5], Mayer used an algebraic identity in order to express the partition function of a gas 
model as a generating function of weighted graphs. By Mayer's transformation, any choice of an 
interaction potential between particles in the gas leads to a specific graph weight. For instance, 
in the case of the one-dimensional hard-core gas, Labelle, Leroux and Ducharme [6] have shown 
that the Mayer's weight of a connected graph G having vertex set V(G) — {0, ...,n} and 
edge set E{G) is w{G) = (— 1)' b ( g )'Vo1(IIg) where VoI(ITg) is the volume of the n-dimensional 
polytope 

II G = {(xi, . . . ,x n ) 6 K™/ x Q — and \xt - Xj\ < 1 for all edge e E{G)}. 
The pressure in the model is related to Mayer's weights by 

_ Z \V(G)\ 

P = kT £ -(G)™,, (1) 

G connected graph 

where k is Boltzmann's constant, T is the temperature and z is the activity. 

It is known (see [5]) that the pressure of the hard-core gas is P = kTL(z), where L(z) is 
the Lambert function defined by the functional equation L(z) = zexp(— L(z)). Comparing this 
expression of the pressure with (|TJ) and extracting the coefficient of z n+1 gives 

J2 AG) = (-1)> + 1)», (2) 
Gec„ 

where the sum is over all connected graphs with n + 1 vertices. Labelle et al. observed that the 
right-hand-side of ^ is, up to sign, the number of rooted Cayley trees with n + 1 vertices and 
asked for a combinatorial explanation [6, Question 1]. The main purpose of this paper is to give 
such an explanation. 



1 



The outline of the paper is as follows. In Section ® we briefly review Mayer's theory of 
cluster integrals following the line of [8]. We illustrate this theory on a very simple model of 
discrete gas and prove the equivalence of this model with the Potts model on the complete 
graph. Comparing two expressions of the pressure in the discrete gas leads to a surprising com- 
binatorial identity. In Section [3) we give a combinatorial proof of this identity. In Section [4j we 
recall Mayer's setting for the hard-core continuum gas and then give a combinatorial proof of 
Equation @ , thereby answering the question of Labelle et al. 

We close this section with some notations. We denote by Z the set of integers and by K the 
set of real numbers. We denote [n] — {1, . . . ,n} and by & n the set of permutations of [n]. In 
this paper, all graphs are simple, undirected and labelled. Let G be a graph. We denote by v(G), 
e(G) and c(G) respectively the number of vertices, edges and connected components of G. A 
graph H is a spanning subgraph of G if the vertex sets of H and G are the same while the edge 
set of H is included in the edge set of G; we denote H C G in this case. We denote by e = 
the edge with endpoints i an j and write e £ G if the edge e belongs to G. For any edge e, we 
denote by G © e the graph obtained from G by either adding the edge e if e ^ G or by deleting 
this edge if e e G. 



2 Review of Mayer's theory of cluster integrals 

Consider a gas made of n (indistinguishable) particles in a vessel Slclf We suppose that the 
gas is free from outside influence and that interaction between two particles i and j at positions 
Xi and Xj is given by the potential 4>(xi,Xj). In the classical Boltzmann setting, the probability 
measure of a configuration is proportional to exp(— TJ/fcT), where k is Boltzmann's constant, T 
is the temperature and H is the Hamiltonian of the system given by 

l<z<n l<i<j<n 

2 

where Xi,Vi, m,i and — ^ are respectively the position, velocity, mass and kinetic energy of the 
i th particle. 

The partition function of the gas model is 

Z(Sl,T,n) = * ([ exp(-H/kT)dx! . . .dx n dvx . . .dv n , 

11 n - JJxi,...,x n eU, vi,...,v n £M d 

where h is Planck's constant. After integrating over all possible velocity, the partition function 
becomes 

where A depends on the temperature T. 

Mayer noticed that the partition function can be decomposed into a sum over graphs. Indeed, 
by setting f(xi, Xj) = exp(-^^) — 1, one gets 
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where the sum is over all graphs on n vertices (equivalently, spanning subgraph of the complete 
graph K n ) and the inner product is over all edges of G. In terms of the partition function, this 
gives Mayer's relation: 



\ n n\Z(tl,T,n) = jj J] exp(- W . . . dx n = £ W{G), 

" i<j ^ ' GCK n 

where W(G) — JJ f(xi, Xj)dx\ . . . dx n is the first Mayer's weight of the graph G. 

n " (i,i)eG 

Example: the discrete gas. Suppose f2 is made of q distinct boxes B\, . . . , B q of volume 1 
and that the interaction potential <p{xi, Xj) is equal to a if the particles i and j are in the same 
box and otherwise. By definition, the Mayer's weight of a graph G is 

W(G)= // JJ f(x i ,Xj)dx 1 ...dx n = E // Y\_f(x i ,x j )dx 1 . . .dx n . 

We denote u = exp(— a/kT) and observe that f(xi,Xj) = u — 1 if i and j are in the same box 
and otherwise. Therefore, the product JX, j)ec f( x i' x j) equals ( u — l) e ^ G ^ if the value of c 
is constant over each connected components of the graph G and otherwise. Summing over all 
mappings c : [n] i— > [q] gives 

W{G) = q c{G \u - l) e< - G \ 

since there are q c ( G ^ mappings c : [n] t— » [q] which are constant over each connected components 
ofG. 

In our discrete gas example, a direct calculation of the partition function gives 
\ n n\Z(n,T,n)= £ ff JJexpf-^^)^...^^ £ 



c:[n]i-»[ 9 ] "^ieBc(l)-i n EBc(„) j<j ' ■'- . c:[n]^[q] 



6(c) 



where S(c) is the number of edges € if n such that c(i) = c(j). Hence, Mayer's relation 

reads 

J2 ^' (c) - E 9 c(G) («-l) e(G) - (3) 

c:[«]h->[g] GCif„ 

Equation ^ is a special case of the equivalence established by Fortuin and Kastelein 0] between 
the partition function of the Potts model (see e.g. [1]) and the Tutte polynomial (see e.g. [2]). 
Indeed, the right-hand-side corresponds to the partition function of the Potts model on the 
complete graph K n while the left-hand-side corresponds to the subgraph expansion of the Tutte 
polynomial of K n up to scaling and change of variables. The relation of Fortuin and Kastelein 
is the generalisation of ^ obtained by replacing the complete graph K n by any graph H . This 
more general case relies on the observation that Y[(i j)eH fa>i = Egch Yi(i j)eG fi>o as soon as 
<t>i,j = 1 + fi,j for all e H. 



We now return to the general theory of Mayer and consider a system with an arbitrary 
number of particles. The grand canonical partition function is defined by 

Z gI (z) = Z gI (n,T,z) = £VA n Z(fi,T,n), 

n<0 
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where z is the activity of the system. In terms of Mayer's weights, the grand canonical partition 
function is the exponential generating functions of graphs weighted by their first Mayer's weight: 

~n 7 v(G) 

= E^ E W(G) = £^(G)™ 

n<0 ' GCK n G y '' 

The macroscopic parameters of the systems, such as the density p, or pressure P, can be 
obtained from Z gv (z) by the relations 

kT z d 

p = pi 1os ( z &W) and p = pi ~q~ z l °s( z ^( z ))- 

Observe that the first Mayer's weight is multiplicative over connected components, that is, 
if a graph G is the disjoint union of two graphs G\ and G2 then W{G) = W(G\)W(G-z). This 
is the key property implying log(Z gr (z)) = J2g colmoctcd W(G)z v ^ (see [6] for a complete 
proof), or equivalently, 

kT ^ z v ^ 
P = Wl S <"» 

G connected 



Example: the discrete gas. For the discrete gas model introduced before, Equation ((4| gives 

p MG) 

w- S (»-i)- <C) ^- < 5 ) 

G connected 

In the special case of an infinite repulsive interaction between particles in the same box, that is, 
a = 00 and u — 0, the pressure P can also be computed directly. Indeed, in this case, one gets 

X n n\Z{Q,T,n) = £ M<5(c) = #{c : [n] •-> [3] injective} = g(g - 1) • • • (g - n + 1), 

c:[n] 

and 

" " / "' (! + *)« 



Z gr (z) = £VA»Z(n,T,n) = W«) 

n>0 ti>0 ^ ' 



This expression for the grand canonical partition function comes to no surprise since each of the 
q boxes contains either nothing (activity 1) or one particle (activity z). Now, 



71-1 

z % 



-| = ilog(Z gr (z)) = log(l + z) = J2- " 

* n>0 

and extracting the coefficient of z n in both side of ([5]) gives 

E ^ 1 ) G(G) = ( 6 ) 

GQK n connected 

Identity ^ is quite surprising at first sight but can be understood by recognising in the 
left-hand-side the evaluation of the Tutte polynomial of K n counting the root-connected acyclic 
orientations (acyclic orientation in which the vertex 1 is the only source) [5]. Indeed, in the case 
of the complete graph K ni root-connected acyclic orientations are linear orderings of [n] in which 
1 is the least element, or equivalently, permutations of {2, . . . ,rt}. In the next section, we give 
a combinatorial proof of Equation ^ which avoids introducing the whole theory of the Tutte 
polynomial (though it is based on it) and prepares for the more evolved proof of Equation ([2]). 
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3 Pressure in the hard-core discrete gas and increasing 
trees 



In this section, we give a combinatorial proof of ([6]) by exhibiting an involution $ on connected 
graphs which cancels the contribution of almost all graphs in the sum E] (— l) e< - G - ) . 

GQK n connected 

We consider the lexicographic order on the edges of K n defined by < (k,l) if either 

min(i, j) < min(/c,Z) or min(i,j) = min(fcJ) and max(i, j) < max(fc,/). For a graph G and an 
edge e = (not necessarily in G), we denote by G >e the spanning subgraph of G made of 

the edges which are greater than e. We say that e = is G-active if there is a path in G >e 
connecting i and j and we denote by e* G the least G-active edge (if there are some). We then 
define a mapping \& on the set of connected graphs by setting: ^f(G) = G if there is no G-active 
edge and ^(G) = G © e* G otherwise. 

Lemma 1 The mapping is an involution on connected graphs. 

Proof: • First observe that the image of a connected graph is connected. Indeed, if the edge e* G 
exists and belongs to G, then it is in a cycle of G and deleting it does not disconnect G. 

• We now prove that any edge is G-active if and only if it is ^(G) -active. Suppose that the edge 
e = (i,j) is G-active and let P be a path of G >e connecting i and j. Since e* G < e the path P 
does not contain e G , hence P C \P(G) >e and e is \E'(G)-active. Suppose conversely that e = (i,j) 
is vp(G)-active and let P be a path of ^(G) >e connecting i and j. UP does not contain e G , then 
P C G >e and e is G-active. Otherwise, e* G > e and there is a path Q of G >Gg C G >e connecting 
the endpoints of e* G . Thus, there is a path contained in (P — e* G ) U Q C G >e connecting i and 
j and again e is G-active. 

• By the preceding point, there is a G-active edge if and only if there is a ^(G^active edge and 
in this case e^ (G) = e* G . Thus, *(*(G)) = G. □ 

The mapping W is an involution and (— l) e ( G ) + (— l) e (*( G )) = whenever G ^ ^(G), hence 

E (-i) e(G) = E (-!) e(G) - ( 7 ) 

GCJf„ connected GCA'„ connected, * (G) = G 

We now characterise the fixed points of the involution \E'. A tree on {1, . . . , n} is said increasing 
if the labels of the vertices are increasing along any simple path starting from the vertex 1. 

Lemma 2 A connected graph G has no G-active edge if and only if it is an increasing tree. 

Proof: • We suppose first that G is an increasing tree and want to prove that no edge is 
G-active. Since G has no cycle, no edge in G is G-active. Consider now an edge e = ^ G 
and the nearest common ancestor k of i and j (the root vertex of G being the vertex 1). There 
is an edge e' — {k,l) containing k on the path of G connecting i and j. Since G is an increasing 
tree, k < min(i, j) and I < max(i,j). Thus, e' = (k,l) < e — (i,j) and e is not G-active. 

• Suppose now that there is no G-active edge. First observe that G is a tree since if G had a cycle 
then the minimal edge in this cycle would be active. We now want to prove that the tree G is 
increasing. Suppose the contrary and consider a sequence of labels 1 = i\ < i2 < ■ ■ ■ < i r > V+i 
on a path of G starting from the vertex i\ = 1. Then, the edge (i, — i, i r +\) is G-active and we 
reach a contradiction. □ 

By Lemma [2] the fixed points of the involution are the increasing trees. The increasing 
trees on {1, . . . , n} are known to be in bijection with the permutations of {2, . . . , n} |10j . Hence, 
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there are (n — 1)! increasing trees on [n] and continuing Equation ([7]) gives 

^ (-l) e ( G ) = ^(_l)e(G)+n-i = ^{i ncr easing trees on [n]} = (n - 1)!. 

GCK n COimOCtcd GCK„ connected 

*(G)=G 

This completes the proof of Equation {g} . 

4 Pressure in the hard-core continuum gas and Cayley 
trees 

In the 1-dimensional hard-core continuum gas, the vessel is an interval = [—q/2, q/2] and the 
potential of interaction between two particles i and j is <j){x%,Xj) = oo if \xi — Xj\ < 1 and 
otherwise. By definition, f(xi,Xj) = ex^(—<f>(xi,Xj)/kT) — 1 is equal to -1 if \xi — Xj\ < 1 and 
otherwise. Thus, the first Mayer weight of a graph G on n vertices, is 

TT f{x i ,Xj)dx 1 ...dx n = (-l)<V [[ TT li^.^i. 

9 1]n /J[ 9 Sift . 

2'2l (i.j)EG JJ l—S'i\ (i,j)eG 

In the thermodynamic limit where the volume q of the Vessel tends to infinity, it becomes 
interesting to consider the second Mayer's weight of connected graphs defined by w(G) — 
lim g ->oo w ( q 'T> G ) anc l related to the pressure by P = kTJ2c connected w(G)z G . In [6], it is 
shown that for any connected graph G on {0, . . . ,n}, the second Mayer weight w(G) equals 
(— l) e ( G )Vol(ri(3), where VoI(IIg) is the volume of the n-dimensional polytope 

II G = {(xi, . . . ,x n ) £ W 1 / x — and \xf — Xj\ < 1 for all edges 6 G}. 

For instance, the polytope Hk 3 is represented in Figure [TJ The rest of this paper is devoted to 
the proof of Equation (|2|) given in the introduction. 
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Figure 1: The polytope ilx 3 (dashed) and its decomposition into subpolytopes. 

Subpolytopes. As observed by Bodo Lass [7], it is possible to decompose the polytope ITg 
into subpolytopes of volume l/n\. Each subpolytope is defined by fixing the integral parts and 
the relative order of the fractional parts of the coordinates x%, . . . ,x n . Let us first recall some 
definitions: for any real number x we write x = h(x) + e(x) where h{x) £ Z is the integral part 
and < e(x) < 1 is the fractional part. Given a vector of integers h = (hi, . . . , h n ) S Z" and 
a permutation a 6 & n , we denote by 7r(h, a) the polytope whose interior is made of the points 
(x\, . . . ,x n ) such that h(xi) = hi for alii = 1 . . .n and < e(x cr -in\) < ■ ■ ■ < e(x cr -i( n )) < 1. 

In particular, the polytope 7r(h, a) contains the point (h\ + • • • , h n + ^p^j ) in its interior. 



W(q,T,G) = 
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Observe that the condition Xj| < 1 is equivalent to h(xi) — h(xj) G {0, siga(e(xj)— e(xi))} 
where the value of sign(a;) is -1 if as < 0, +1 if x > and if x = 0. Therefore, a point 
(x\, . . . ,x n ) in the interior of the polytope 7r(h, a) is in Hq if and only if G G implies 

hi — hj G {0, sign(<j(j) — <x(i))} with the convention that ho = and <r(0) = 0. This condition 
only depends on the pair (h, a), therefore either the polytope 7r(h, a) is included in the polytope 
He or the interiors of the two polytopes are disjoints. Moreover, a simple calculation shows that 
the volume of the polytope 7r(h, a) is 1/n! for all h £ Z",er G & n . This proves the following 
lemma. 

Lemma 3 For any connected graph G on {0, ...,n} 7 the value n\Vol(Ho) counts the pairs 
h £ U 1 ,o~ G & n such that ir(h, a) is a subpolytope o/IIq. 

For example, the polytope Hk 3 represented in Figure [T] contains 6 subpolytopes ~k\ — 
tt((-1, -1), 12), tt 2 = 7r((-l, -1), 21), ir 3 = vr((0, -1), 12), 7r 4 - tt((-1, 0), 21), n 5 = tt((0, 0), 12), 
ttq = 7r((0,0),21) each having volume 1/2. 



Rearrangement. Summing the second Mayer's weights over connected graphs and using 
Lemma [3] gives 

£ w(G) = £ (-ir( G >Voi(n G ) = I 2 (- 1 ) e(G) - ( 8 ) 

GeC„ GeC„ ' heZ",<reS„,GGC„ / 7r(h,CT)Cn G 

Let cr be a permutation of [n]. For any vector h = (hi, . . . , h n ) in Z™, we denote c(h) = 
(/lo-m, . . . , /i CT ( n )). For any graph G labelled on {0, . . . , n}, we denote by cr(G) the graph where 
the label i is replaced by a(i) for alH = 1, . . . , n. 

Lemma 4 Let h be a vector in Z n , let a be a permutation of [n] and let G be a graph. Then, 
Tt(h,a) C Llfj if and only if "n(o~~ l (h),Id) C IL^c?)* where Id is the identity permutation. 

We omit the proof of Lemma2]which is straightforward. From this Lemma, one gets for any 
permutation a of [n], 

(-i) e(G) = ^ (-i) e(G) = v (-i) e ( ff_i ( G )) = v (-i) e(G) . 

h£Z", GEC„ heZ™, GEC„ heZ™, GEC„ h£Z™, Gec„ 

»(i,«)£n G »(o--i(h),id)cn <T(G) x(h,id)cn G T (h,id)cn G 

where the second equality is obtained by changing the order of summations on the graphs G 
and on the vectors h. Therefore, continuing Equation (JSJ) gives 

£ = i £ £ (-i) e(G) = e (-D^ G ). o) 

GeC„ <tGS„ hez™, Gec„ hez™, g<=c„ 

T(h,<r)cn G «(h,id)cn G 



Killing involution. Let h be a vector in Z™ . We will now evaluate the sum (— l) e ' ' 

GeC„ / 7r(hJd)Cn G 

thanks to a killing involution similar to the one defined in Section [3J To the vector h = 
(hi, . . . , h n ) we associate the centroid h = (ho, hi, ... , /i„), where hi = hi + for i = 0, . . . , n 
with the convention that ho = 0. We also denotes by Gh the graph on {0, . . . , n} whose edges 
are the pairs (i,j) such that \hi — hj\ < 1. Observe that for any graph G, 7r(h,Id) C TLq if and 
only if (hi, . . . , h n ) G Hg if and only if G C Gh- 



7 



We order the edges e = of the graph Gh by the lexicographic order on the correspond- 
ing pairs (hj,hj), that is, (i,j) < (k,l) if either rnin(hj,hj) < min(hfc,hi) or min(hi,hj) = 
min(hfc, h;) and max(hi, h,-) < max(hfe, hi). For a graph G C Gh and an edge e = (i,j) in Gh, 
we denote by G >e the set of edges in G that are greater than e and we say that e is (G, h)-active 
if there is a path in G >e connecting i and j. We also denote by e* G h the least (G, h)-active edge 
if there is any. We then define a mapping on the set of connected spanning subgraphs of Gh 
by: x f'h(G) = G if there is no G-active edges and 'I'h(G) = G © e* G h otherwise. 

Lemma 5 For any vector h in Z™, the mapping is an involution on the connected subgraphs 
ofG h . 

The proof of Lemma [5] is identical to the proof of Lemma [1] As a consequence, one gets 

E (-i) e(G) = E i- 1 ^- 

GCGh connected GCGh connected, ^ h (G)— G 

We now characterise the fixed points of "Jh- Let io be the index of the least coordinate of 

the centroid h (that is, hi = min ie { n }(h>i))- A tree on {0, ...,n} is said h-increasing if 

the labels io, ii, ■ ■ ■ ,ik of the vertices along any simple path starting at vertex io are such that 
h ia < h i± < ■ ■ ■ < h ik . 

Lemma 6 Let h be a vector in Z™. A connected graph G C Gh is an h-increasing tree if and 
only if there is no (G, h)-active edge. 

Proof: • We suppose first that G is an h-increasing tree. Since G has no cycle, no edge in G is 
active. Consider now an edge e = ^ G and the nearest common ancestor k of i and j (the 
root vertex of G being the vertex io). Let also e' = (fc, I) be an edge containing k on the path 
in G between i and j. Since G is an h-increasing tree, h^ < min(hj, hj) and h/ < max(hj, hj). 
Thus, e' = (k,l) < e = (i,j) and e is not (G, h)-active. 

• Suppose now that there is no (G, h)-active edge. First observe that G is a tree since if G had 
a cycle then the minimal edge in this cycle would be active. We now want to prove that the 
tree G is h-increasing. Suppose the contrary and consider a sequence of labels io,«i, • • • , Vj V+i 
such that hi < ■ ■ ■ < hi r _ 1 < hi T , hi T+1 on a path of G starting from the vertex io- Then, the 
edge (i r -i) ir+i) belongs to Gh (since \hi r l — hj r+1 \ < max(|/ii r l — hi r \, \hi r — hi r+1 1) < 1) and 
is (G, h)-active. We reach a contradiction. □ 

From Lemma HI one gets for any vector h 6 Z" , 

E (-l) e(G) = {h-increasing trees}. 

GCGh connected 

Therefore, continuing Equation ([9]) gives 

E w{G) = E E (-l) e(G) = (-1)" E # {h-increasing trees}. (10) 

G£C„ heZ" GCGh connected hCZ" 



Cayley trees. We now relate h-increasing trees and Cayley trees. 

Lemma 7 Any rooted Cayley tree on {0, . . . , n} with root io is h-increasing for exactly one 

vector h in Z™ such that hi — min (hi). 

te{o,...,n} 
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Proof: Let T be a Cayley tree rooted on «o- The tree T is h- increasing with hi = minjg.ro,. ...nlC^i) 
if and only if any vertex j ^ ig satisfies hi < hj < hi + 1, where i is the father of j. The condition 
hi < hj < hi + 1 holds if and only if either i < j and hj — hi or j < i and hj = hi + 1. Therefore, 
tree T is h-increasing with hi = min(/ii) if and only if for all index i = 0, . . . , n, the difference 
hi — hi is the number of descents in the sequence of labels io,i±, . . . ,i s = i along the path of T 
from io to i (a descent is an index r < s such that i r+ i < i r ). Knowing that ho — completes 
the proof. □ 

It is well known that the number of rooted Cayley trees on {0, . . . ,n} is (n + 1)". Thus, 
Lemma [7] gives 

w{G) = (-1)" # h-increasing trees = (-l) n (n + 1)". (11) 
GeC„ hez™ 

This completes the proof of Equation (|2|) and answers the question of Labelle et al. [6] . 
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